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CLAIMS 

What is claimed is: 



1 1 . A speech recognition apparatus comprising: 

2 a transformation processor configured to transform at least one phoneme 

3 sequence included in speech into at least one word sequence, and to provide, for said 

4 word sequence, an appearance probability indicating that said phoneme sequence 

5 originally represented said word sequence; 

pp a renewal processor configured to renew said appearance probability, provided 

CT? for said word sequence by said transformation processor, based on a renewed 

In 

0 numerical value indicated by language models corresponding to said word sequence 

*2) provided by said transformation processor; and 

mPd a recognition processor configured to recognize speech by selecting one of said 

i*i word sequences for which the renewed appearance probability is the highest to indicate 

jjfe that said phoneme sequence originally represented said selected word sequence; 
jb wherein said renewal processor calculates said renewed numerical value using a 

34 first language model, which is especially prepared for expressions unique to 

15 spontaneous speech, and a second language model, which differs from said first 

16 language model, and employs said renewed numerical value to renew said appearance 

17 probability. 

1 2. The speech recognition apparatus according to claim 1 , wherein said first 

2 language model represents a probability that said word sequence, which includes a 

3 predetermined word included in an expression unique to spontaneous speech, is said 

4 word sequence or said originally represented phoneme sequence. 

1 3. The speech recognition apparatus according to claim 2, wherein, when said 

2 predetermined word is included in the speech recognition results, said transformation 

3 processor transforms said phoneme sequence into a word sequence included in said 
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4 predetermined word, and said renewal processor renews the appearance probabilities 

5 of said word sequence based on said first language model and said second language 

6 model. 

1 4. The speech recognition apparatus according to claim 1 , wherein said first 

2 language model employs, as an element, a word set including a disfluency. 

1 5. The speech recognition apparatus according to claim 1 , wherein said first and 

JTI said second language models are defined as N-gram models, and wherein said renewal 

CI processor employs, as said renewed numerical value, the weighted average value of 
said first and said second language models. 

i 6. A computer system comprising the speech recognition apparatus according to 

5^2 claim 1 . 

ru 

Jpi 7. A speech recognition method comprising: 

«2 transforming at least one phoneme sequence included in speech into at least 

3 one word sequence, and providing an appearance probability for said obtained word 

4 sequence indicating that said phoneme sequence originally represented said word 

5 sequence; 

6 renewing said appearance probability provided for each of said word sequences 

7 when said word sequence obtained at said transformation step include a word unique to 

8 spontaneous speech, referring to a first model, which is especially prepared for an 

9 expression unique to spontaneous speech, and a second model, which differs from said 

10 first model; and 

1 1 recognizing said speech and selecting one of said word sequences for which the 

12 renewed appearance probability is the highest, to indicate that said phoneme sequence 

13 originally represented said selected word sequence. 
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1 8. The speech recognition method according to claim 7, further comprising: 

2 writing said first language model in correlation with the appearance probability of 

3 a word sequence which includes a word unique to spontaneous speech with a 

4 combination of N consecutive words. 

1 9. The speech recognition method according to claim 8, wherein said word unique 

2 to spontaneous speech is a disfluency. 

9 10. The speech recognition method according to claim 7, wherein said renewing step 

d further comprises: 

in 

H renewing said appearance probability provided for said word sequence by 

g| referring to a third language model, which is especially prepared for specific symbols 
included in said word sequence or sequences. 



jf 1 11. A program that permits a computer to perform: 

J2 acoustically analyzing speech data and transforming said speech data into a 

feature vector; 

4 generating acoustic data, for which an appearance probability is provided, for at 

5 least one phoneme sequence that may correspond to said feature vector obtained from 

6 said analyzing step; 

7 transforming said phoneme sequence into at least one word sequence, while a 

8 disfluency is included as a word choice selection; 

9 renewing said appearance probability by referring to a disfluency language 

10 model that is written by correlating the appearance probability of at least one word 

11 sequence in which a disfluency is included with a combination of N consecutive words; 

12 and 

13 speech recognizing said speech data by using as a speech recognition result 

14 one of said word sequences for which the renewed appearance probability is the 

15 highest. 
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1 1 2. The program according to claim 1 1 , wherein said transforming step further 

2 comprises: 

3 adding a symbol to a word indicating that said word is a disfluency choice to 

4 distinguish said word choice from another word. 

1 13. The program according to claim 1 1 , wherein said speech recognizing step further 

2 comprises: 

S outputting said word sequence to which the highest appearance probability 

C| applies as text data. 

f 2i 14. The program according to claim 12, wherein said speech recognizing step further 

C2 comprises; 

a: 

removing said word to which said symbol has been added from said word 

JlH sequence having the highest appearance probability, and outputting the resultant word 

,p5 sequence as text data. 

RJ 

1 15. The program according to claim 1 1 , wherein said renewing step further 

2 comprises: 

3 renewing said appearance probability by referring to said disfluency language 

4 model and to a general-purpose language model. 

1 1 6. The program according to claim 1 1 , wherein said transforming step further 

2 comprises: 

3 transforming said phoneme sequence into a word sequence, while a pause 

4 included in said speech data is included as a punctuation choice, and renewing said 

5 appearance probability by further referring to a punctuation language model that is 

6 limited to punctuation insertion. 
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1 17. A machine-readable storage, having stored thereon a computer program having 

2 a plurality of code sections executable by a machine for causing the machine to 

3 perform the steps of: 

4 acoustically analyzing speech data and transforming said speech data into a 

5 feature vector; 

6 generating acoustic data, for which an appearance probability is provided, for a 

7 phoneme sequence that may correspond to said feature vector obtained by said 

8 analyzing step; 

!f| if a disfluency is to be reflected in a recognition result, transforming said 

f§ phoneme sequence into a word sequence, with a disfluency being included as a word 

p choice, or if a disfluency is not to be reflected in a recognition result, transforming said 

% phoneme sequence into a word sequence without including said disfluency as a word 

# choice; 

K." 

f|4 if a disfluency is to be reflected in a recognition result, renewing said appearance 

^$ probability by referring to the general-purpose language model and a disfluency 

Jb language model, which is written by correlating the appearance probability of said word 

f j 

3? sequence that includes a disfluency with a combination of N consecutive words, or if a 

18 disfluency is not to be reflected in a recognition result renewing said appearance 

19 probability by referring to the general-purpose language model; and 

20 speech recognizing said speech data by using said word sequence for which the 

21 renewed appearance probability is the highest as a speech recognition result. 

1 18. The machine-readable storage according to claim 17, wherein said transforming 

2 step further comprises: 

3 adding a symbol to a word indicating that said word is a disfluency word choice 

4 to distinguish between said word and another. 

1 1 9. The machine-readable storage according to claim 1 7, wherein said disfluency 

2 language model and said general-purpose language model are N-gram models; and 
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wherein, at said renewing step, said appearance probability is renewed by using the 
weighted average value of said disfluency language model and said general-purpose 
language model. 

20. The machine-readable storage according to claim 17, wherein said transforming 
step further comprises: 

transforming said phoneme sequence into a word sequence, while a pause is 
included as a punctuation choice in said speech data, and renewing said appearance 
probability by referring to a punctuation language model that is limited to punctuation 
insertion, to automatically insert punctuation in a speech recognition result. 
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